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Abstract 

The vertebrate immune system is a wonder of modern evolution. Occasionally, how- 
ever, correlations within the immune system lead to inappropriate recruitment of 
preexisting T cells against novel viral diseases. We present a random energy the- 
ory for the correlations in the naive and memory T cell immune responses. The 
non-linear susceptibility of the random energy model to structural changes captures 
the correlations in the immune response to mutated antigens. We show how the 
sequence-level diversity of the T cell repertoire drives the dynamics of the immune 
response against mutated viral antigens. 
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1 Introduction 



In this work, we develop a random energy model that allows us to address 
limitations in the cellular immune system response to mutable viral diseases. 
We focus on the T cell response, with an emphasis on cytotoxic lympho- 
cyte (CTL) T cells, which bind the peptide-major histocompatability class I 
(MHCI) complex [1]. These T cells eradicate other infected cells through an 
interaction mediated by the binding of the T cell receptor (TCR) to antigenic 
peptides derived from the infectious disease. The finite number of different T 
cells within an individual and the existence of immune system memory makes 
the immune system response to mutating viral diseases nontrivial. We seek 
to understand how the T cell receptor repertoire changes within the immune 
system in response to repeated exposure to evolving viral diseases. 
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The immune system has a mechanism for selecting among the many possible 
TCRs those that best bind to the antigenic peptides. TCRs are constructed 
from modular elements, and each individual has an approximate diversity of 
2.5 x 10 7 different receptors [2]. TCRs undergo rounds of selection for in- 
creased affinity [3]. TCRs do not undergo any further mutation during the 
immune response. Those TCRs that are stochastically selected in the primary 
response to a disease become memory cells [4]. While many details affect the 
T cell selection process [5,6], selection for increased affinity is thought to be a 
dominant factor [7]. 

Several limitations of the cellular immune system have been reported that 
stem from the fact that a single T cell receptor may bind multiple antigenic 
peptides. This cross-reactivity is studied in quantitative molecular experiments 
through the use of altered peptide ligands (APLs), peptides that differ from 
the native ligand of the TCR by one amino acid [8]. We here develop a theory 
of the T cell response to altered peptide ligands. In section 2, we introduce the 
random energy model of the interaction between the TCR and the peptide- 
MHC complex. In section 3, we describe the process by which T cells that 
bind antigen are selected. In section 4, we show how experiments with altered 
peptide ligands may be understood with the random energy model. We discuss 
a critical point in the immune response probability in section 5. In section 6, 
we discuss correlations that may arise in forward and reverse experiments with 
altered peptide ligands. We conclude in section 7. 



2 The Random Energy Model 

The important molecular components of the T cell recognition event are the 
peptide-MHCI complex and the T cell receptor (TCR), as shown in Figure 1. 
Typically, the peptide ligand is on the order of 9 amino acids long. The TCR 
variable region is composed of 54 amino acids, grouped into 6 subdomains. 
The TCR sits atop the peptide-MHCI complex during recognition. Recogni- 
tion of specific immunogenic peptides is due to the interaction between the 
subdomains of the TCR and the peptide. The majority of the amino acids 
in the peptide are directly recognized by 1-4 amino acids of the TCR, with 
typically 3 amino acids in the peptide being hot spots of the binding [9]. The 
entire TCR variable region interacts relatively non-specifically with the MHCI 
complex. There is also a relatively generic interaction between the peptide and 
the MHCI complex. 

There are too many atoms within the TCR-peptide-MHCI complex and too 
many TCRs within an individual to treat this problem in full atomistic detail. 
We therefore use a type of random energy model to represent the interactions 
between the TCR, peptide, and MHCI complex. This random energy builds 
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a hierarchy of correlations into Derrida's random energy theory [10]. The im- 
mune system being a real-time example of an evolving system, the random 
energy approach is a natural one [11]. Our random energy model is a gener- 
alization of Kauffman's NK model [12] to include correlations due to protein 
secondary structure [13,14]. The model introduced here to account for the T 
cell immune response contains an additional term relative to that which de- 
scribes the B cell immune response [15]. In detail, our generalized NK model 
for the T cell response considers four different kinds of interactions: interac- 
tion within a subdomain of the TCR (£/ sd ), interactions between subdomains 
of the TCR (£/ sd_sd ), indirect interactions between the TCR and the peptide 
(C/ pep-sd ), and direct binding interaction between the TCR and peptide (U c ). 
In this model, interactions between the peptide or TCR and the MHCI have 
been integrated out. As is typical in statistical mechanics [16], integrating 
out these terms produces random interactions in the Hamiltonian, and we as- 
sume these to be roughly of the form of the other intermolecular terms in our 
model. In the altered peptide ligand experiments considered here, the pep- 
tides, whether original or altered, are typically expressed by the same MHCI. 
The first three terms of the present random energy model are identical to 
those used to model protein evolution [14] and the B cell antibody response 
[15]. The parameters within the model have been determined either by earlier 
work [12,13] or by structural biology [14]. The energy function of the TCR 
with peptide-MHCI is 

MM M N h N CON 
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where M = 6 is the number of TCR secondary structural subdomains, iV b = 3 
is the number of hot-spot amino acids that directly bind to the TCR, and 
-^con = 3 is the number of T cell amino acids contributing directly to the 
binding of each peptide amino acid. The subdomain energy U sd is 

^ N-K+l 
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where N = 9 is the number of amino acids in a subdomain, and K = 4 is the 
range of local interaction within a subdomain. The prefactor for this term, and 
the other three terms in Eq. 1, is chosen so that random sequences produce a 
unit variance of this term. Thus, each term in Eq. 1 contributes roughly equal 
weight, a priori. All subdomains belong to one of L = 5 different types (e.g., 
helices, strands, loops, turns, and others). The quenched Gaussian random 
number a ai is different for each value of its argument for a given subdomain 
type, ccj. All a values in the model are Gaussian random numbers and have 
zero mean and unit variance. The sigma values are different for each value 
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of the argument, subscript, or superscript. The variable on defines the type 
of secondary structure for the i th subdomain, 1 < aij < L. The energy of 
interaction between secondary structures is 



j rsd— sd / 

ij ' V DM(M - 1) 
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(3) 



We set the number of interactions between secondary structures at D — 2, 
as the TCR-peptide-MHCI interaction is a slightly less rough than is the 
antibody- antigen interaction landscape where we have used D = 6 [15]. Here 
afj and the interacting amino acids, ji, ■ ■ -,jK, are selected at random for each 
interaction (i,j,k). The indirect interaction energy between the peptide and 
the TCR is given by 



rjpcp-sd 7 1 y k( pep . . . pep i _ _ _ i \ / 4 N 
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Here <rf and the interacting amino acids, ji, ■ ■ -,jK, are selected at random 
in the peptide and TCR subdomain for each interaction (i,k). The chemical 
binding energy of each TCR amino acid to the peptide is given by 

U!j= /N \ MO J2 )- (5) 
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The contributing amino acids, j\,j2, and the unit-normal weight of the bind- 
ing, dij, are chosen at random for each interaction with iV b possible 
values for ji, and NM possible values for j 2 . 

Although there are 20 different naturally occurring amino acids, there are only 
roughly five distinct classes. Mutations that change an amino acid to another 
within the same class are termed conservative, whereas mutations that change 
the class of the amino acid are termed non-conservative. This distinction is 
significant because non- conservative mutations change the energy landscape 
more dramatically than do conservative mutations. Non-conservative muta- 
tions lead to a factor of 2.34 w (1 + 1/2 2 ) 1 / 2 / (1/2) greater energy change than 
do conservative mutations, on average as estimated from statistically deter- 
mined energy values from the protein sequence alignment matrix (PAM) [17]. 
To consider all 20 amino acids within the random energy model, Eq. 1, and to 
consider the differing effects of conservative and non- conservative mutations, 
we set the random a for amino acid i that belongs to group j as a = Wj + Wi/2, 
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where the w are Gaussian random numbers with zero average and unit stan- 
dard deviation. There are 5 groups, with 8 amino acids in the neutral and 
polar plus cystein group, 2 amino acids in the negative and polar group, 3 
amino acids in the positive and polar group, 4 amino acids in the nonpolar 
without ring group, and 3 amino acids in the nonpolar with ring group. The 
results are not greatly sensitive to the precise groupings of the amino acids. 

The naive TCR repertoire is generated randomly from gene fragments. This is 
accomplished by constructing the TCRs from subdomain pools. Fragments for 
each of the L subdomain types are chosen randomly from 5 of the 100 lowest 
energy subdomain sequences. This diversity mimics the known TCR diversity, 
(5 x L) M 10 s . There have been recent suggestions that the possible diversity 
of a given individual's TCR repertoire is ~ 10 11 , with the actual diversity 
expressed at any point in time being ps 10 8 [18]. These numbers, which are 
very much in line with the numbers for human antibodies, would imply L = 13. 
The results for the present study are not greatly sensitive to whether L = 5 
or L = 13. 

The binding constants between the peptide-MHCI ligand and the TCR are 
calibrated by fixing the worst and geometric mean of the binding constants 
evolved during a primary response to be 3 x 10 5 1/mol and 3 x 10 6 1/mol, 
respectively, where the binding constant is related to the energy by 

k = e a ~ bU . (6) 

That is, fixing the worst and geometric mean of the binding constants to 
these values determines the values of a and b for each instance of the random 
parameters with in the generalized NK model. This approach leads to the best 
binding constant being on average 3 x 10 7 1/mol. These binding constants are 
taken from experiment [19] and are slightly smaller than those for antibodies. 
Whenever the peptide substrate is changed, the constants a and b in Eq. 6 
are reevaluated by comparison to the existing naive TCR repertoire. All terms 
from Eq. 1 are significant, because which of the TCRs from the repertoire best 
bind depends on the identity of the peptide. 



3 The T Cell Maturation Process 

The T-cell-mediated response is driven by cycles of concentration expansion 
and selection for better binding constants. Selection processes are ubiquitous 
in nature [20], although the exact mechanism of the T cell expansion during 
the primary immune response remains elusive. It is clear that the expansion 
of the T cells is non-linear [3]. It also seems that, in most cases, there is 
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competition among the T cells for the presented antigen [7,21]. Since the T cells 
do not mutate during the primary response, our model of the primary response 
dynamics to be detailed below can be viewed as one particular description of 
the non-linear expansion of the naive T cell repertoire. The primary response 
increases the concentration of selected TCRs by 1000 fold over 10 days, with a 
rough T cell doubling time of one day. The diversity of the memory sequences 
is 0.5% of that of the naive repertoire [2]. The secondary response increases 
the concentration of the selected sequences by 10 fold over a few days. For 
humans, there are roughly 2.5 x 10 7 distinct T cell sequences at a copy number 
of 2.4 x 10 4 in the naive repertoire and roughly 1.5 x 10 5 sequences at a copy 
number of 2 x 10 6 in the memory repertoire [2]. Typical best and worst binding 
constants of the memory repertoire are 10 7 1/mol and 10 5 1/mol, respectively 
[19]. 

We implement a selection model of the T cell immune system maturation. 
Specifically, 10 rounds of selection are performed during the primary response, 
with the top x = 58% of the sequences chosen at each round. That is, the 
probability of picking one of the sequences for the next round is 



select 



jr^r — , for U < U* 

U.58iV s i ze ' — * /rj\ 

0, for U>U* 



where U* is the energy for which 58% of the sequences lie below. This equation 
is employed N sizc times, where iV size is the diversity of the T cell repertoire, 
i. e. the number of distinct T cell present within the immune system, to select 
randomly the A^ size sequences for the next round. This procedure mimics the 
concentration expansion factor of 10 3 ~ 2 10 in the primary response and leads 
to 0.5% diversity of the memory repertoire, because 0.58 10 ~ 0.5% and 10 
days of doubling leads to a concentration expansion of 2 10 = 1024. 

As discussed in more detail in section 4, the quality of the primary immune 
response is often measured experimentally by an in vitro or ex vivo assay. For 
an in vivo secondary response, we calculate the average binding constants of 
the memory and naive responses, and whichever is larger determines whether 
the response will be from the memory or naive repertoires. In fact, the memory 
binding constant is multiplied by a factor of 100, as memory T cells are present 
at higher concentrations and more broadly present in the tissue than are naive 
T cells. If the memory cells are used in the secondary in vivo response, the top 
x = 58% of the sequences are chosen, and 3 rounds of selection are performed 
[3,4]. This mimics the concentration factor of 10 ~ 2 3 during the secondary 
memory response. Conversely, if the naive cells are used in the secondary in 
vivo response, the dynamics is identical to that of the primary response. For 
an in vitro secondary response, in which memory T cells are extracted and 
stimulated in an in vitro experiment, 3 rounds of selection are performed, 



6 



starting with exclusively memory sequences. 

Implementation of our theory proceeds by computational simulation of the 
generalized NK model. First, the peptide and the altered peptide ligands are 
created. Then, the random terms such as the secondary structural types of the 
subdomains, the a values, the binding sites, and the interaction sites of the 
generalized NK model are determined. Then the fraction and identity of the T 
cell repertoire that responds well to the peptide ligand is identified. Typically, 
1 in 10 5 T cells responds well to any particular ligand, and one thus expects 
on the order of 10 8 /10 5 = 1000 T cells to participate in the naive response. 
Then the values of the constants a and b in Eq. 6 are calculated. The primary 
response of 10 rounds of selection is then carried out. Finally, the secondary 
response, either 10 rounds from the naive pool or 3 rounds from the memory 
pool, depending on the relative binding constants for the altered peptide is 
carried out. 



4 Experiments with Altered Peptide Ligands 

A fundamental way to measure the correlation between immune responses to 
related antigens is to use altered peptide ligands (APLs). In such experiments, 
an immune response is first generated to the original peptide. The peptide 
antigen is then changed, typically either conservatively or non-conservatively 
and only at one amino acid position [22]. The immune response to this al- 
tered peptide antigen is then measured. Intuitively, one expects that if the 
peptide antigen is altered only slightly, the immune response to the APL will 
be relatively high, whereas if the peptide is altered significantly, the immune 
response to the APL will be rather low. 

Specific lysis is a measure of the probability that an activated T cell will 
recognize an antigen presenting cell that is expressing a particular peptide- 
MHCI complex. This quantity is measured as a function of the effector to 
target ratio, E /T , the ratio of the number of T cells to the number of antigen 
presenting cells. Each T cell typically expresses on the order of 10 5 identical 
TCRs, and each antigen presenting cell typically expresses on the order of 
2 x 10 4 peptide-MHCI complexes [19]. In typical experiments, the specific 
lysis is measured over a 4 hour time period [23]. Although T cells can each 
kill many targets in vivo, the experimental specific lysis assay requires the 
number of of T cells to be greater than the number of targets for detectable 
killing. Lysis is a statistical event, being on average roughly proportional to 
the probability of a T cell binding the target cell. To calculate the specific 
lysis curve, we calculate the amount of the target cells that are bound by all 
T cells. We do this by considering the equilibrium between the effector T cells 
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and the target antigen presenting cells: 



E t + T # E t T, Ki . (8) 

This equation implies [E^T] /([£*] [T]) = Ki. Introducing the amount of antigen 
presenting cells that are bound by T cell i, £j, and noting that the effector 
concentration is typically higher than the concentration of antigen presenting 
cells, we find 

& = AV?°(T -£&) • (9) 



Summing over all T cells, we find for the total amount of antigen presenting 
cells bound by any T cell: 

• ™° (10) 



where L = X^Ci/^o is the specific lysis. Writing this in the form 

= zE /T 
1 + zE /T ' 



'11) 



we find z = T J2iKiE^/E = T (K). Thus, the competitive binding pro- 
cess for the antigen presenting cells can be viewed as a Langmuir adsorption 
isotherm of the T cells onto the antigen presenting cells. 

The binding constant Ki is that between the two cells. Assuming that the 
free energies of binding for each of the TCR/peptide-MHCI interactions are 
approximately additive, we find Ki = 10 5 x 2 x 10 4 /cj, where ki is the molecular 
binding constant between TCR % and the peptide-MHCI complex. Typical 
values of the target cell concentration are 2 x 10 3 target cells in 200 /il of 
solution, or T = 1.66 x 10~ 17 M [23]. We, thus, find 

' = 3^- (12) 



Interestingly, the maximum value of the binding constant in Eq. 6 is of the 
same order of magnitude as the denominator in Eq. 12. For convenience, we 
have chosen them to be identical. Adding in some cooperativity in the binding, 
i.e. assuming that the free energies of binding for each of the TCR/peptide- 
MHCI interactions are not entirely additive, will only change details such as 
the denominator in Eq. 12. 
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At infinite dilution of the T cells, we find that the average number of antigen 
presenting cells that are lysed by one T cell is LT Q /E = z. The quantity 
z is, therefore, the average clearance probability of one T cell. Since the T 
cell typically can lyse no more than one target cell in specific lysis assays, we 
modify the definition of the clearance probability as 

A/siz e 

z = — J2mm(l,k i /3xl0 7 ). (13) 

-''size i = \ 



This equation is implemented by introducing a lower cutoff in the energy 
according to 3 x 10 7 = exp(a — bU m \ n ). 

The immune response probability is a measure of the probability that the 
clearance probability is greater than 50%. In other words, the immune response 
probability is the average of H(L — 1/2), where H(x) = 0,x < and H(x) = 
l,x > 0. Both the specific lysis and the immune response probability are 
averaged over many instances of the random energy model, typically 10 4 . 

Both specific lysis and immune response probability can be measured exper- 
imentally either in vitro or ex vivo. Typically these responses are measured 
after the primary response to the original peptide ligand and before a true 
secondary response to the altered peptide ligand. Experimentally, the ex vivo 
response is measured by preparing mice, immunizing with the virus, and after 
8-10 days removing the spleen of the mouse. The T cells from the spleen are 
then challenged with antigen presenting cells expressing the specific altered 
peptide ligand. The spleen in this case contains both memory and naive T 
cells. For the ex vivo APL response, we use A sizc /2 distinct memory cells and 
Assize distinct naive cells, where A size is the diversity of the naive T cell reper- 
toire, to calculate the observables [2]. Experimentally, the in vitro response is 
measured by removing the spleen of the mouse, immunizing the spleen cells 
with the virus, and after 8-10 days challenging those spleen cells with antigen 
presenting cells expressing the specific altered peptide ligand. The spleen in 
this case contains predominantly memory cells. For the in vitro APL response, 
we use exclusively A size memory cells to calculate the observables. 

We use this theory to analyze the correlations in the immune response that are 
measured in altered peptide ligand experiments. The non-linear susceptibility 
of the random energy model to structural changes allows us to capture the 
correlations in the immune response to altered peptide antigens. An extensive 
set of quantitative experiments on altered peptide ligands has been carried out 
on the mouse model viral disease lymphocytic choriomeningitis virus (LCMV). 
Due to the particularly strong immunogenicity of LCMV, the memory T cell 
population is comprised essentially exclusively of T cells from the primary 
LCMV response, in contrast to the more typical case where 1-10% of the 
memory T cell population is specific for a particular disease [24]. Shown in 
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Figure 2 is a comparison between the measured and calculated ex vivo and 
in vitro responses to altered peptide ligands with a single non- conservative 
mutation. Experiments have been carried out with conservative mutations 
as well, and shown in Figure 3 is a comparison between the measured and 
calculated ex vivo and in vitro responses to altered peptide ligands with a 
single conservative mutation. 

The in vitro response is always greater than the ex vivo, because a purely 
memory response is better than a naive response for these peptides altered by 
one amino acid. The responses are slightly superior for the conservative mu- 
tations than for the non- conservative mutations, because in the conservative 
case the altered peptide is more similar to the original peptide upon which 
the memory sequences were evolved. 

Immune response probabilities have not been widely measured. In a classic 
study, Klenerman and Zinkernagel found that 1 out of 7 non-conservatively 
altered peptides produced a response to LCMV [22]. This data point is shown 
in Figure 4. Also shown are the ex vivo and in vitro immune response prob- 
abilities to altered peptide ligands with a single non-conservative mutation. 
Shown in Figure 5 are the ex vivo and in vitro immune response probabilities 
to altered peptide ligands with a single conservative mutation. 

The in vitro response remains superior to the ex vivo response, as the pep- 
tides with a single amino acid mutation are best recognized by the mem- 
ory sequences. In addition, the conservative response is stronger than the 
non- conservative response, because the conservatively mutated peptides are 
more similar to the native target of the memory sequences than are the non- 
conservatively mutated peptides. 



5 A Critical Point in the Immune Response Probability 

It can be seen that larger repertoire sizes lead to a sharpening of the immune 
response probability curve. In fact, there is a critical point in our model which 
occurs at E /T = 4.6 for the in vitro and E /T Q = 16 for the ex vivo case. The 
critical immune response probability lies in the range 0.14-0.23. The critical 
value of Eq/Tq is shifted to a higher value for the ex vivo case because the 
average energies are not as favorable in this case. Randomness in the ensemble, 
due to the energy fluctuations, causes the immune response probability in the 
infinite repertoire limit to be a smooth curve, rather than a step function. Such 
randomness would correspond, for example, to the variability in an individual's 
response to a variety of disease strains or the variability in the response of a 
population of individuals to a specific disease strain. In our model, there is 
an additional source of randomness for a finite repertoire size, the inexact 
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evaluation of the constants a and b in Eq. 6. Assuming that the variation 
in a is more significant and is Gaussian, we can roughly say z = exp(a + 
aa — bU), where a is a Gaussian with unit variance and zero mean, a = 
0(l/\/N sizc ) is the rough size in the error of the calculation of a, and U is 
random with a certain probability distribution. We calculate the contribution 
of the randomness in a to the immune response probability, F, as 



OF _ dH dL 
da dL da 



, ,dL dz . 
\(°\L = l/2) 



= i J daP{a)a J duP{u)5{L - 1/2) 

= \ I daP ( a } aP ( u *^ xc Ma+av-bu*)=i (14) 
Assuming that the randomness in the energy is also Gaussian, we find 



x a ~ Hu) t^ X (15) 

where x = ({ u — ( u )) 2 )- The fixed point occurs for x* = exp(— a + b(u)) and 
F* = 1/2. The deviation of the immune response probability from the infinite 
repertoire size limit is 0(a 2 ) = 0(1/N size ). The shape of this curve is very 
similar to that seen in Figures 4 and 5. If we make a more detailed analysis 
that takes into account the negative asymmetry of the probability distribution 
for Inz, which is due to the cutoff in Eq. (13), we find that F* is lowered from 
1/2. The ex vivo probability distribution for In z is more symmetric than is 
the in vitro probability distribution, and so the ex vivo fixed point should be 
higher than the in vitro fixed point. In addition, the conservative probability 
distribution for In z has a smaller negative tail than does the non- conservative 
distribution, and so the conservative fixed point should be higher than the 
non- conservative fixed point. These predictions are in agreement with the 
observations in Figures 4 and 5. 



6 Correlation in Forward and Reverse APL Experiments 



Using our random energy model, it is possible to examine how correlations in 
the forward altered peptide ligand experiment give rise to altered correlations 
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in the reverse altered peptide ligand experiment. For example in Ref. [22], one 
set of experiments was performed with LCMV wild-type as the original peptide 
and LCMV-8.7 as the altered peptide ligand, and another set of experiments 
was performed with LCMV-8.7 as the original peptide and LCMV wild type 
as the altered peptide ligand. In Figure 6, we show these data as well as a 
representative set of curves from our theory. 

A more systematic way to study such forward and reverse experiments is to 
look at the correlation between a response in the forward original peptide 
ligand (OPL) — > APL experiment and the reverse APL — > OPL experiment. 
We introduce the correlation matrix 



C 



{z A z B } (z A (l - Z B )) ^ 

\({l-z A )z B ) ((1 - z A )(l - Z B )) j 



(16) 



Here z A is the in vitro clearance probability of the APL in the forward exper- 
iment where the memory sequences were evolved against the OPL, and z B is 
the in vitro clearance probability of the OPL in the reverse experiment, where 
the memory sequences were evolved against the APL. The averages are taken 
over instances of the random energy model. Although the average response 
is the same for the forward and reverse experiments, (z A ) = (z B ), there may 
be a correlation between the two responses. To specify the notation, we note 
that the probability of a forward response is given by P(A) = (z A ) and that 
of a reverse response by P(B) = (z B ). The probability of both a forward 
and reverse response is given by P(A, B) = (z A z B ) = C\\. The conditional 
probability that the reverse experiment is successful, given that the forward 
experiment is successful, is 

P(B\A) - P(AB) - P{AB) - ^ (17) 
P[m) ~ P(A) ~ P{B) ~(z A )- (17) 



From our model, we find (z A ) = 0.113 and C\\ = 0.01651, and thus P(B\A) = 
0.146. These values are not greatly sensitive to the repertoire size. The cross 
correlation is given by (AC) 2 = C n - P(A)P(B) = 0.003797. The cross 
correlation can alternatively be expressed as 

P(B\A) ( AC \ 2 



If there were no correlation between the two experiments, we would find 
P(B\A)/ P(A) = 1. If there were complete correlation between the experi- 
ments, P(B\A)/P(A) = l/P{A) = 8.87. In fact, we find in our model that 
P(B\A)/P(A) = 1.30, showing a modest amount of correlation between the 
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OPL — > APL and APL — > OPL experiments. Experimental measurements of 
this predicted correlation would be very interesting. 



7 Summary 

The random energy model of the T cell immune system presented here allows 
an investigation of the sequence-level evolution that occurs within the T cell 
repertoire. This is remarkable, given the complexity underlying the immune 
response. Correlations in the immune response to mutated antigens are cap- 
tured by the non-linear susceptibility of the underlying random energy model 
to structural changes. Specific lysis curves for the model virus LCMV are well 
predicted by the theory. The immune response probability is shown to vary in 
a systematic fashion with the effector to target ratio, and more experimental 
measurements of this quantity are needed. 

The effective design of vaccines for viral diseases requires some estimation 
of the likely escape mechanisms of the virus. For example, for rapidly mu- 
tating strains, a highly-diverse, multicomponent vaccine may be necessary to 
halt progression and transmission of the virus. Conversely, for slowly evolv- 
ing strains, diversity within the vaccine may simply dilute the conference of 
protective immunity. The approach taken here allows investigation and deter- 
mination of the fundamental qualitative and quantitative features that govern 
the interaction between an effective multicomponent vaccine and the variabil- 
ity of the virus. The theory complements and may provide some guidance to 
the long and difficult process of experimental multicomponent vaccine devel- 
opment against escaping viral diseases. 
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Fig. 1. Figure of the interaction between the TCR and the peptide-MHCI com- 
plex. The TCR consists of two domains, a. and /3, each of which contains three 
complementary determining regions (CDRs) that interact with the peptide-MHCI 
complex. The MHCI complex consists of two domains, a and (3, and the a domain is 
comprised of two halves, a\ and a<i- a) Top view of the peptide-MHCI complex. The 
peptide (gray) sits like a "hotdog in a bun" in the a\ and «2 domains of the MHCI 
complex (black), b) Side view of CDR 1 (gray) from TCR domain a interacting 
with the first few amino acids of the peptide and the MHCI and of CDR 1 (gray) 
from TCR domain (3 interacting with the last few amino acids of the peptide and 
the MHCI. The TCR (light gray) sits on top of the peptide-MHCI complex. The 
peptide sites between the TCR and MHCI complex, c) CDR 2 (gray) from TCR 
domain a interacts with the first few amino acids of the peptide and the MHCI, and 
CDR 2 (gray) from TCR domain (3 interacts with the last few amino acids of the 
peptide and the MHCI. d) CDR 3 (gray) from both TCR domains interacts with 
the middle few amino acids of the peptide. Atomistic details of this interaction are 
from an X-ray crystal structure ([25], http:/ /www.rcsb.org/pdb/, accession number 
1A07). The interactions between the peptide and the TCR are represented by the 
last two terms of Eq. 1. The TCR itself must fold, and these terms are represented 
by the first two terms of Eq. 1. The interactions between the TCR and the MHCI 
complex have been integrated out in the model. 
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Fig. 2. Specific lysis as a function of effector to target ratio for repertoire sizes in 
the range 10 3 to 10 6 for non-conservatively altered peptide ligands. The theoretical 
curves overlap for all repertoire sizes. Results are for ex vivo (upper) and in vitro 
(lower). Experimental data (circles) are from the LCMV mouse models of [23] (ex 
vivo) and [26] (in vitro). 
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Fig. 3. Specific lysis as a function of effector to target ratio for repertoire sizes in 
the range 10 3 to 10 6 for conservatively altered peptide ligands. Results are for ex 
vivo (upper) and in vitro (lower). Experimental data (circles) are from the LCMV 
mouse models of [22] (ex vivo) and [27] (in vitro). 
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Fig. 4. Immune response probability as a function of effector to target ratio for 
repertoire sizes of 10 3 , 3 x 10 3 , 10 4 , 3 x 10 4 , 10 5 , and 3 x 10 5 for non-conservatively 
altered peptide ligands. Larger repertoire sizes lead to a greater slope at the inflec- 
tion point of the curve. Results are for ex vivo (upper) and in vitro (lower). Exper- 
imental datum (horizontal dashed line) is taken from the LCMV mouse model of 
[22] (live mouse, in vivo). 
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Fig. 5. Immune response probability as a function of effector to target ratio for 
conservatively altered peptide ligands. Repertoire sizes as in Figure 4. Results are 
for ex vivo (upper) and in vitro (lower). 



1 




*==^ — i — r 1 1 — i 

1 10 100 

Effector to target ratio 

Fig. 6. Representative specific lysis curves for a forward and backward altered 
peptide ligand experiment. Data from experiments with LCMV-WE original ligand 
and LCMV-8.7 altered peptide ligand (circles) and LCMV-8.7 original ligand and 
LCMV-WE altered peptide ligand (squares) from the in vitro LCMV mouse model 
of [22]. 
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